Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 162 |
| Missing cells | 162 |
| Missing cells (%) | 7.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 16.6 KiB |
| Average record size in memory | 104.8 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 1 |
Country has a high cardinality: 162 distinct values | High cardinality |
Confirmed is highly correlated with Deaths and 2 other fields | High correlation |
Deaths is highly correlated with Confirmed and 1 other fields | High correlation |
Recovered is highly correlated with New_cases and 3 other fields | High correlation |
Active is highly correlated with Confirmed and 1 other fields | High correlation |
New_cases is highly correlated with Confirmed and 5 other fields | High correlation |
New_deaths is highly correlated with New_cases and 1 other fields | High correlation |
New_recovered is highly correlated with Recovered and 4 other fields | High correlation |
Total_Recovered is highly correlated with Recovered and 3 other fields | High correlation |
Recovered_Percent is highly correlated with Recovered and 3 other fields | High correlation |
Confirmed is highly correlated with Deaths and 7 other fields | High correlation |
Deaths is highly correlated with Confirmed and 7 other fields | High correlation |
Recovered is highly correlated with Confirmed and 7 other fields | High correlation |
Active is highly correlated with Confirmed and 7 other fields | High correlation |
New_cases is highly correlated with Confirmed and 7 other fields | High correlation |
New_deaths is highly correlated with Confirmed and 7 other fields | High correlation |
New_recovered is highly correlated with Confirmed and 7 other fields | High correlation |
Total_Recovered is highly correlated with Confirmed and 7 other fields | High correlation |
Recovered_Percent is highly correlated with Confirmed and 7 other fields | High correlation |
Confirmed is highly correlated with Deaths and 6 other fields | High correlation |
Deaths is highly correlated with Confirmed and 6 other fields | High correlation |
Recovered is highly correlated with Confirmed and 5 other fields | High correlation |
Active is highly correlated with Confirmed and 6 other fields | High correlation |
New_cases is highly correlated with Confirmed and 4 other fields | High correlation |
New_deaths is highly correlated with Confirmed and 4 other fields | High correlation |
New_recovered is highly correlated with Recovered and 4 other fields | High correlation |
Total_Recovered is highly correlated with Confirmed and 5 other fields | High correlation |
Recovered_Percent is highly correlated with Confirmed and 5 other fields | High correlation |
New_recovered is highly correlated with New_cases and 5 other fields | High correlation |
New_cases is highly correlated with New_recovered and 7 other fields | High correlation |
New_deaths is highly correlated with New_recovered and 6 other fields | High correlation |
Confirmed is highly correlated with New_recovered and 7 other fields | High correlation |
Recovered is highly correlated with New_recovered and 5 other fields | High correlation |
Population_(in_thousands)_total is highly correlated with New_cases | High correlation |
Total_Recovered is highly correlated with New_recovered and 5 other fields | High correlation |
Active is highly correlated with New_cases and 3 other fields | High correlation |
Deaths is highly correlated with Confirmed and 1 other fields | High correlation |
Recovered_Percent is highly correlated with New_recovered and 5 other fields | High correlation |
Confirmed has 18 (11.1%) missing values | Missing |
Deaths has 18 (11.1%) missing values | Missing |
Recovered has 18 (11.1%) missing values | Missing |
Active has 18 (11.1%) missing values | Missing |
New_cases has 18 (11.1%) missing values | Missing |
New_deaths has 18 (11.1%) missing values | Missing |
New_recovered has 18 (11.1%) missing values | Missing |
Total_Recovered has 18 (11.1%) missing values | Missing |
Recovered_Percent has 18 (11.1%) missing values | Missing |
Country is uniformly distributed | Uniform |
df_index has unique values | Unique |
Country has unique values | Unique |
Population_annual_growth_rate_(%) has 2 (1.2%) zeros | Zeros |
Deaths has 16 (9.9%) zeros | Zeros |
Recovered has 5 (3.1%) zeros | Zeros |
Active has 4 (2.5%) zeros | Zeros |
New_cases has 29 (17.9%) zeros | Zeros |
New_deaths has 80 (49.4%) zeros | Zeros |
New_recovered has 56 (34.6%) zeros | Zeros |
Total_Recovered has 5 (3.1%) zeros | Zeros |
Recovered_Percent has 5 (3.1%) zeros | Zeros |
Reproduction
| Analysis started | 2022-01-27 09:05:17.244459 |
|---|---|
| Analysis finished | 2022-01-27 09:05:33.517941 |
| Duration | 16.27 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 162 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 91.25925926 |
| Minimum | 0 |
|---|---|
| Maximum | 184 |
| Zeros | 1 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 9.05 |
| Q1 | 44.25 |
| median | 94.5 |
| Q3 | 136.75 |
| 95-th percentile | 173.95 |
| Maximum | 184 |
| Range | 184 |
| Interquartile range (IQR) | 92.5 |
Descriptive statistics
| Standard deviation | 53.93400102 |
|---|---|
| Coefficient of variation (CV) | 0.5909975762 |
| Kurtosis | -1.228570574 |
| Mean | 91.25925926 |
| Median Absolute Deviation (MAD) | 46.5 |
| Skewness | -0.01065380737 |
| Sum | 14784 |
| Variance | 2908.876467 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 184 | 1 | 0.6% |
| 44 | 1 | 0.6% |
| 64 | 1 | 0.6% |
| 63 | 1 | 0.6% |
| 62 | 1 | 0.6% |
| 60 | 1 | 0.6% |
| 59 | 1 | 0.6% |
| 58 | 1 | 0.6% |
| 57 | 1 | 0.6% |
| 56 | 1 | 0.6% |
| Other values (152) | 152 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 184 | 1 | |
| 183 | 1 | |
| 182 | 1 | |
| 181 | 1 | |
| 180 | 1 | |
| 178 | 1 | |
| 177 | 1 | |
| 175 | 1 | |
| 174 | 1 | |
| 173 | 1 |
| Distinct | 162 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 KiB |
| Dominican Republic | 1 |
|---|---|
| Mauritius | 1 |
| Mali | 1 |
| Fiji | 1 |
| Sweden | 1 |
| Other values (157) |
Length
| Max length | 32 |
|---|---|
| Median length | 7.5 |
| Mean length | 9.24691358 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1498 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 162 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Afghanistan |
|---|---|
| 2nd row | Albania |
| 3rd row | Algeria |
| 4th row | Andorra |
| 5th row | Angola |
Common Values
| Value | Count | Frequency (%) |
| Dominican Republic | 1 | 0.6% |
| Mauritius | 1 | 0.6% |
| Mali | 1 | 0.6% |
| Fiji | 1 | 0.6% |
| Sweden | 1 | 0.6% |
| Guatemala | 1 | 0.6% |
| Portugal | 1 | 0.6% |
| Iceland | 1 | 0.6% |
| Azerbaijan | 1 | 0.6% |
| Algeria | 1 | 0.6% |
| Other values (152) | 152 |
Length
| Value | Count | Frequency (%) |
| and | 6 | 2.7% |
| republic | 4 | 1.8% |
| saint | 3 | 1.4% |
| islands | 3 | 1.4% |
| guinea | 3 | 1.4% |
| rep | 3 | 1.4% |
| new | 3 | 1.4% |
| congo | 2 | 0.9% |
| netherlands | 2 | 0.9% |
| china | 2 | 0.9% |
| Other values (187) | 188 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 223 | |
| i | 128 | 8.5% |
| n | 113 | 7.5% |
| e | 109 | 7.3% |
| o | 89 | 5.9% |
| r | 84 | 5.6% |
| l | 57 | 3.8% |
| 57 | 3.8% | |
| u | 53 | 3.5% |
| t | 48 | 3.2% |
| Other values (45) | 537 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1212 | |
| Uppercase Letter | 213 | 14.2% |
| Space Separator | 57 | 3.8% |
| Other Punctuation | 12 | 0.8% |
| Dash Punctuation | 2 | 0.1% |
| Open Punctuation | 1 | 0.1% |
| Close Punctuation | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 223 | |
| i | 128 | |
| n | 113 | |
| e | 109 | |
| o | 89 | 7.3% |
| r | 84 | 6.9% |
| l | 57 | 4.7% |
| u | 53 | 4.4% |
| t | 48 | 4.0% |
| s | 47 | 3.9% |
| Other values (16) | 261 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 24 | |
| C | 20 | 9.4% |
| B | 19 | 8.9% |
| M | 19 | 8.9% |
| N | 14 | 6.6% |
| A | 13 | 6.1% |
| G | 13 | 6.1% |
| P | 12 | 5.6% |
| L | 12 | 5.6% |
| R | 11 | 5.2% |
| Other values (12) | 56 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5 | |
| . | 5 | |
| ' | 2 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 57 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1425 | |
| Common | 73 | 4.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 223 | |
| i | 128 | 9.0% |
| n | 113 | 7.9% |
| e | 109 | 7.6% |
| o | 89 | 6.2% |
| r | 84 | 5.9% |
| l | 57 | 4.0% |
| u | 53 | 3.7% |
| t | 48 | 3.4% |
| s | 47 | 3.3% |
| Other values (38) | 474 |
Common
| Value | Count | Frequency (%) |
| 57 | ||
| , | 5 | 6.8% |
| . | 5 | 6.8% |
| ' | 2 | 2.7% |
| - | 2 | 2.7% |
| ( | 1 | 1.4% |
| ) | 1 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1498 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 223 | |
| i | 128 | 8.5% |
| n | 113 | 7.5% |
| e | 109 | 7.3% |
| o | 89 | 5.9% |
| r | 84 | 5.6% |
| l | 57 | 3.8% |
| 57 | 3.8% | |
| u | 53 | 3.5% |
| t | 48 | 3.2% |
| Other values (45) | 537 |
| Distinct | 155 |
|---|---|
| Distinct (%) | 95.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12181.80387 |
| Minimum | 2 |
|---|---|
| Maximum | 160943 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 68.3 |
| Q1 | 913 |
| median | 5409 |
| Q3 | 12661.46108 |
| 95-th percentile | 39442.75 |
| Maximum | 160943 |
| Range | 160941 |
| Interquartile range (IQR) | 11748.46108 |
Descriptive statistics
| Standard deviation | 23688.70406 |
|---|---|
| Coefficient of variation (CV) | 1.944597393 |
| Kurtosis | 23.36785329 |
| Mean | 12181.80387 |
| Median Absolute Deviation (MAD) | 4901.5 |
| Skewness | 4.508260728 |
| Sum | 1973452.228 |
| Variance | 561154699.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12661.46108 | 7 | 4.3% |
| 739 | 2 | 1.2% |
| 19159 | 1 | 0.6% |
| 1328 | 1 | 0.6% |
| 100 | 1 | 0.6% |
| 5743 | 1 | 0.6% |
| 6640 | 1 | 0.6% |
| 6969 | 1 | 0.6% |
| 455 | 1 | 0.6% |
| 5259 | 1 | 0.6% |
| Other values (145) | 145 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 10 | 1 | |
| 14 | 1 | |
| 20 | 1 | |
| 31 | 1 | |
| 33 | 1 | |
| 50 | 1 | |
| 58 | 1 | |
| 68 | 1 | |
| 74 | 1 |
| Value | Count | Frequency (%) |
| 160943 | 1 | |
| 155991 | 1 | |
| 144720 | 1 | |
| 86264 | 1 | |
| 81021 | 1 | |
| 60644 | 1 | |
| 48379 | 1 | |
| 45558 | 1 | |
| 39459 | 1 | |
| 39134 | 1 |
| Distinct | 50 |
|---|---|
| Distinct (%) | 30.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.347164929 |
| Minimum | -2.5 |
|---|---|
| Maximum | 4.3 |
| Zeros | 2 |
| Zeros (%) | 1.2% |
| Negative | 16 |
| Negative (%) | 9.9% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | -2.5 |
|---|---|
| 5-th percentile | -0.495 |
| Q1 | 0.5 |
| median | 1.377245509 |
| Q3 | 2.1 |
| 95-th percentile | 3.1 |
| Maximum | 4.3 |
| Range | 6.8 |
| Interquartile range (IQR) | 1.6 |
Descriptive statistics
| Standard deviation | 1.179776256 |
|---|---|
| Coefficient of variation (CV) | 0.8757474529 |
| Kurtosis | 0.3310562222 |
| Mean | 1.347164929 |
| Median Absolute Deviation (MAD) | 0.822754491 |
| Skewness | -0.1632871861 |
| Sum | 218.2407186 |
| Variance | 1.391872013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.377245509 | 7 | 4.3% |
| 0.5 | 7 | 4.3% |
| 0.3 | 7 | 4.3% |
| 1.1 | 7 | 4.3% |
| 1.7 | 7 | 4.3% |
| 2.5 | 6 | 3.7% |
| 0.6 | 6 | 3.7% |
| 1.8 | 6 | 3.7% |
| 1.3 | 5 | 3.1% |
| 2.2 | 5 | 3.1% |
| Other values (40) | 99 |
| Value | Count | Frequency (%) |
| -2.5 | 1 | 0.6% |
| -2.2 | 1 | 0.6% |
| -1.1 | 2 | |
| -0.9 | 1 | 0.6% |
| -0.7 | 1 | 0.6% |
| -0.6 | 1 | 0.6% |
| -0.5 | 2 | |
| -0.4 | 1 | 0.6% |
| -0.3 | 4 | |
| -0.1 | 2 |
| Value | Count | Frequency (%) |
| 4.3 | 1 | 0.6% |
| 4 | 1 | 0.6% |
| 3.9 | 2 | |
| 3.6 | 1 | 0.6% |
| 3.5 | 1 | 0.6% |
| 3.3 | 1 | 0.6% |
| 3.2 | 1 | 0.6% |
| 3.1 | 3 | |
| 3 | 4 | |
| 2.9 | 1 | 0.6% |
Confirmed
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 142 |
|---|---|
| Distinct (%) | 98.6% |
| Missing | 18 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16136.09722 |
| Minimum | 10 |
|---|---|
| Maximum | 301708 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 24.45 |
| Q1 | 824.5 |
| median | 3133.5 |
| Q3 | 15701.5 |
| 95-th percentile | 66995.8 |
| Maximum | 301708 |
| Range | 301698 |
| Interquartile range (IQR) | 14877 |
Descriptive statistics
| Standard deviation | 33081.59998 |
|---|---|
| Coefficient of variation (CV) | 2.050161172 |
| Kurtosis | 38.91159733 |
| Mean | 16136.09722 |
| Median Absolute Deviation (MAD) | 3047.5 |
| Skewness | 5.17137023 |
| Sum | 2323598 |
| Variance | 1094392258 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 86 | 2 | 1.2% |
| 10621 | 2 | 1.2% |
| 79395 | 1 | 0.6% |
| 114 | 1 | 0.6% |
| 64156 | 1 | 0.6% |
| 509 | 1 | 0.6% |
| 462 | 1 | 0.6% |
| 674 | 1 | 0.6% |
| 11424 | 1 | 0.6% |
| 24 | 1 | 0.6% |
| Other values (132) | 132 | |
| (Missing) | 18 | 11.1% |
| Value | Count | Frequency (%) |
| 10 | 1 | |
| 12 | 1 | |
| 14 | 1 | |
| 17 | 1 | |
| 18 | 1 | |
| 20 | 1 | |
| 23 | 1 | |
| 24 | 1 | |
| 27 | 1 | |
| 48 | 1 |
| Value | Count | Frequency (%) |
| 301708 | 1 | |
| 116458 | 1 | |
| 92482 | 1 | |
| 82040 | 1 | |
| 81161 | 1 | |
| 79395 | 1 | |
| 71181 | 1 | |
| 67096 | 1 | |
| 66428 | 1 | |
| 64379 | 1 |
Deaths
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 111 |
|---|---|
| Distinct (%) | 77.1% |
| Missing | 18 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 869.7361111 |
| Minimum | 0 |
|---|---|
| Maximum | 45844 |
| Zeros | 16 |
| Zeros (%) | 9.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 11 |
| median | 59.5 |
| Q3 | 333.5 |
| 95-th percentile | 2580.85 |
| Maximum | 45844 |
| Range | 45844 |
| Interquartile range (IQR) | 322.5 |
Descriptive statistics
| Standard deviation | 4044.85707 |
|---|---|
| Coefficient of variation (CV) | 4.650671644 |
| Kurtosis | 108.7461998 |
| Mean | 869.7361111 |
| Median Absolute Deviation (MAD) | 58.5 |
| Skewness | 9.921162681 |
| Sum | 125242 |
| Variance | 16360868.71 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16 | 9.9% |
| 11 | 4 | 2.5% |
| 7 | 3 | 1.9% |
| 2 | 3 | 1.9% |
| 1 | 3 | 1.9% |
| 10 | 3 | 1.9% |
| 8 | 3 | 1.9% |
| 22 | 2 | 1.2% |
| 35 | 2 | 1.2% |
| 69 | 2 | 1.2% |
| Other values (101) | 103 | |
| (Missing) | 18 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 16 | |
| 1 | 3 | 1.9% |
| 2 | 3 | 1.9% |
| 3 | 1 | 0.6% |
| 4 | 1 | 0.6% |
| 5 | 1 | 0.6% |
| 6 | 1 | 0.6% |
| 7 | 3 | 1.9% |
| 8 | 3 | 1.9% |
| 10 | 3 | 1.9% |
| Value | Count | Frequency (%) |
| 45844 | 1 | |
| 9822 | 1 | |
| 8944 | 1 | |
| 6160 | 1 | |
| 5700 | 1 | |
| 5532 | 1 | |
| 4652 | 1 | |
| 2647 | 1 | |
| 2206 | 1 | |
| 1978 | 1 |
Recovered
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 136 |
|---|---|
| Distinct (%) | 94.4% |
| Missing | 18 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7084.340278 |
| Minimum | 0 |
|---|---|
| Maximum | 55057 |
| Zeros | 5 |
| Zeros (%) | 3.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12.15 |
| Q1 | 279.5 |
| median | 1548 |
| Q3 | 6587.75 |
| 95-th percentile | 34540.7 |
| Maximum | 55057 |
| Range | 55057 |
| Interquartile range (IQR) | 6308.25 |
Descriptive statistics
| Standard deviation | 11256.0658 |
|---|---|
| Coefficient of variation (CV) | 1.588865774 |
| Kurtosis | 3.320876643 |
| Mean | 7084.340278 |
| Median Absolute Deviation (MAD) | 1523.5 |
| Skewness | 1.978748305 |
| Sum | 1020145 |
| Variance | 126699017.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5 | 3.1% |
| 18 | 2 | 1.2% |
| 128 | 2 | 1.2% |
| 803 | 2 | 1.2% |
| 39 | 2 | 1.2% |
| 32455 | 1 | 0.6% |
| 22 | 1 | 0.6% |
| 32856 | 1 | 0.6% |
| 440 | 1 | 0.6% |
| 1616 | 1 | 0.6% |
| Other values (126) | 126 | |
| (Missing) | 18 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 8 | 1 | 0.6% |
| 11 | 1 | 0.6% |
| 12 | 1 | 0.6% |
| 13 | 1 | 0.6% |
| 15 | 1 | 0.6% |
| 18 | 2 | 1.2% |
| 19 | 1 | 0.6% |
| 22 | 1 | 0.6% |
| 23 | 1 | 0.6% |
| Value | Count | Frequency (%) |
| 55057 | 1 | |
| 45692 | 1 | |
| 37202 | 1 | |
| 36110 | 1 | |
| 35375 | 1 | |
| 35086 | 1 | |
| 34896 | 1 | |
| 34838 | 1 | |
| 32856 | 1 | |
| 32455 | 1 |
Active
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 132 |
|---|---|
| Distinct (%) | 91.7% |
| Missing | 18 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8182.020833 |
| Minimum | 0 |
|---|---|
| Maximum | 254427 |
| Zeros | 4 |
| Zeros (%) | 2.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 90.25 |
| median | 785 |
| Q3 | 4708 |
| 95-th percentile | 40496.15 |
| Maximum | 254427 |
| Range | 254427 |
| Interquartile range (IQR) | 4617.75 |
Descriptive statistics
| Standard deviation | 25533.98103 |
|---|---|
| Coefficient of variation (CV) | 3.120742607 |
| Kurtosis | 62.29309013 |
| Mean | 8182.020833 |
| Median Absolute Deviation (MAD) | 776.5 |
| Skewness | 7.065655334 |
| Sum | 1178211 |
| Variance | 651984187.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4 | 2.5% |
| 1 | 3 | 1.9% |
| 2 | 3 | 1.9% |
| 21 | 2 | 1.2% |
| 52 | 2 | 1.2% |
| 1599 | 2 | 1.2% |
| 9 | 2 | 1.2% |
| 13 | 2 | 1.2% |
| 476 | 1 | 0.6% |
| 179 | 1 | 0.6% |
| Other values (122) | 122 | |
| (Missing) | 18 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 1 | 3 | |
| 2 | 3 | |
| 4 | 1 | 0.6% |
| 8 | 1 | 0.6% |
| 9 | 2 | |
| 12 | 1 | 0.6% |
| 13 | 2 | |
| 15 | 1 | 0.6% |
| 18 | 1 | 0.6% |
| Value | Count | Frequency (%) |
| 254427 | 1 | |
| 107514 | 1 | |
| 73695 | 1 | |
| 53649 | 1 | |
| 52992 | 1 | |
| 47064 | 1 | |
| 47056 | 1 | |
| 40733 | 1 | |
| 39154 | 1 | |
| 36378 | 1 |
New_cases
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 88 |
|---|---|
| Distinct (%) | 61.1% |
| Missing | 18 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 179.8680556 |
| Minimum | 0 |
|---|---|
| Maximum | 2029 |
| Zeros | 29 |
| Zeros (%) | 17.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2.75 |
| median | 24 |
| Q3 | 159 |
| 95-th percentile | 724.55 |
| Maximum | 2029 |
| Range | 2029 |
| Interquartile range (IQR) | 156.25 |
Descriptive statistics
| Standard deviation | 342.2070335 |
|---|---|
| Coefficient of variation (CV) | 1.902544799 |
| Kurtosis | 10.46339334 |
| Mean | 179.8680556 |
| Median Absolute Deviation (MAD) | 24 |
| Skewness | 2.995285886 |
| Sum | 25901 |
| Variance | 117105.6538 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 29 | 17.9% |
| 1 | 5 | 3.1% |
| 11 | 4 | 2.5% |
| 13 | 3 | 1.9% |
| 7 | 3 | 1.9% |
| 24 | 3 | 1.9% |
| 10 | 3 | 1.9% |
| 4 | 3 | 1.9% |
| 3 | 3 | 1.9% |
| 5 | 3 | 1.9% |
| Other values (78) | 85 | |
| (Missing) | 18 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 29 | |
| 1 | 5 | 3.1% |
| 2 | 2 | 1.2% |
| 3 | 3 | 1.9% |
| 4 | 3 | 1.9% |
| 5 | 3 | 1.9% |
| 6 | 2 | 1.2% |
| 7 | 3 | 1.9% |
| 8 | 1 | 0.6% |
| 9 | 1 | 0.6% |
| Value | Count | Frequency (%) |
| 2029 | 1 | |
| 1752 | 1 | |
| 1592 | 1 | |
| 1248 | 1 | |
| 1146 | 1 | |
| 1104 | 1 | |
| 835 | 1 | |
| 731 | 1 | |
| 688 | 1 | |
| 682 | 1 |
New_deaths
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 23 |
|---|---|
| Distinct (%) | 16.0% |
| Missing | 18 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.638888889 |
| Minimum | 0 |
|---|---|
| Maximum | 64 |
| Zeros | 80 |
| Zeros (%) | 49.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 3 |
| 95-th percentile | 16.85 |
| Maximum | 64 |
| Range | 64 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 8.904003273 |
|---|---|
| Coefficient of variation (CV) | 2.446901663 |
| Kurtosis | 22.46227029 |
| Mean | 3.638888889 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.375050525 |
| Sum | 524 |
| Variance | 79.28127428 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 80 | |
| 1 | 14 | 8.6% |
| 2 | 9 | 5.6% |
| 3 | 7 | 4.3% |
| 4 | 5 | 3.1% |
| 6 | 5 | 3.1% |
| 5 | 4 | 2.5% |
| 11 | 3 | 1.9% |
| 13 | 2 | 1.2% |
| 7 | 2 | 1.2% |
| Other values (13) | 13 | 8.0% |
| (Missing) | 18 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 80 | |
| 1 | 14 | 8.6% |
| 2 | 9 | 5.6% |
| 3 | 7 | 4.3% |
| 4 | 5 | 3.1% |
| 5 | 4 | 2.5% |
| 6 | 5 | 3.1% |
| 7 | 2 | 1.2% |
| 8 | 1 | 0.6% |
| 9 | 1 | 0.6% |
| Value | Count | Frequency (%) |
| 64 | 1 | |
| 50 | 1 | |
| 46 | 1 | |
| 28 | 1 | |
| 27 | 1 | |
| 20 | 1 | |
| 19 | 1 | |
| 17 | 1 | |
| 16 | 1 | |
| 14 | 1 |
New_recovered
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 70 |
|---|---|
| Distinct (%) | 48.6% |
| Missing | 18 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 107.6666667 |
| Minimum | 0 |
|---|---|
| Maximum | 1601 |
| Zeros | 56 |
| Zeros (%) | 34.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 5 |
| Q3 | 103.5 |
| 95-th percentile | 672.75 |
| Maximum | 1601 |
| Range | 1601 |
| Interquartile range (IQR) | 103.5 |
Descriptive statistics
| Standard deviation | 233.9982368 |
|---|---|
| Coefficient of variation (CV) | 2.173358237 |
| Kurtosis | 14.54854139 |
| Mean | 107.6666667 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 3.488900208 |
| Sum | 15504 |
| Variance | 54755.17483 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 56 | |
| 2 | 5 | 3.1% |
| 4 | 4 | 2.5% |
| 1 | 4 | 2.5% |
| 6 | 3 | 1.9% |
| 39 | 2 | 1.2% |
| 15 | 2 | 1.2% |
| 103 | 2 | 1.2% |
| 3 | 2 | 1.2% |
| 22 | 2 | 1.2% |
| Other values (60) | 62 | |
| (Missing) | 18 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 56 | |
| 1 | 4 | 2.5% |
| 2 | 5 | 3.1% |
| 3 | 2 | 1.2% |
| 4 | 4 | 2.5% |
| 5 | 2 | 1.2% |
| 6 | 3 | 1.9% |
| 8 | 1 | 0.6% |
| 11 | 2 | 1.2% |
| 14 | 1 | 0.6% |
| Value | Count | Frequency (%) |
| 1601 | 1 | |
| 1007 | 1 | |
| 955 | 1 | |
| 843 | 1 | |
| 829 | 1 | |
| 749 | 1 | |
| 684 | 1 | |
| 681 | 1 | |
| 626 | 1 | |
| 558 | 1 |
Total_Recovered
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 136 |
|---|---|
| Distinct (%) | 94.4% |
| Missing | 18 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7192.006944 |
| Minimum | 0 |
|---|---|
| Maximum | 55741 |
| Zeros | 5 |
| Zeros (%) | 3.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12.15 |
| Q1 | 281 |
| median | 1594.5 |
| Q3 | 6925.25 |
| 95-th percentile | 34656.3 |
| Maximum | 55741 |
| Range | 55741 |
| Interquartile range (IQR) | 6644.25 |
Descriptive statistics
| Standard deviation | 11402.35621 |
|---|---|
| Coefficient of variation (CV) | 1.585420633 |
| Kurtosis | 3.281412196 |
| Mean | 7192.006944 |
| Median Absolute Deviation (MAD) | 1570 |
| Skewness | 1.971340951 |
| Sum | 1035649 |
| Variance | 130013727 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5 | 3.1% |
| 128 | 2 | 1.2% |
| 18 | 2 | 1.2% |
| 39 | 2 | 1.2% |
| 803 | 2 | 1.2% |
| 4301 | 1 | 0.6% |
| 127 | 1 | 0.6% |
| 11 | 1 | 0.6% |
| 189 | 1 | 0.6% |
| 104 | 1 | 0.6% |
| Other values (126) | 126 | |
| (Missing) | 18 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 8 | 1 | 0.6% |
| 11 | 1 | 0.6% |
| 12 | 1 | 0.6% |
| 13 | 1 | 0.6% |
| 15 | 1 | 0.6% |
| 18 | 2 | 1.2% |
| 19 | 1 | 0.6% |
| 22 | 1 | 0.6% |
| 23 | 1 | 0.6% |
| Value | Count | Frequency (%) |
| 55741 | 1 | |
| 45863 | 1 | |
| 37519 | 1 | |
| 36531 | 1 | |
| 36041 | 1 | |
| 35845 | 1 | |
| 35533 | 1 | |
| 34896 | 1 | |
| 33298 | 1 | |
| 32959 | 1 |
Recovered_Percent
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 136 |
|---|---|
| Distinct (%) | 94.4% |
| Missing | 18 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 71.92006944 |
| Minimum | 0 |
|---|---|
| Maximum | 557.41 |
| Zeros | 5 |
| Zeros (%) | 3.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1215 |
| Q1 | 2.81 |
| median | 15.945 |
| Q3 | 69.2525 |
| 95-th percentile | 346.563 |
| Maximum | 557.41 |
| Range | 557.41 |
| Interquartile range (IQR) | 66.4425 |
Descriptive statistics
| Standard deviation | 114.0235621 |
|---|---|
| Coefficient of variation (CV) | 1.585420633 |
| Kurtosis | 3.281412196 |
| Mean | 71.92006944 |
| Median Absolute Deviation (MAD) | 15.7 |
| Skewness | 1.971340951 |
| Sum | 10356.49 |
| Variance | 13001.3727 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5 | 3.1% |
| 1.28 | 2 | 1.2% |
| 8.03 | 2 | 1.2% |
| 0.39 | 2 | 1.2% |
| 0.18 | 2 | 1.2% |
| 1.89 | 1 | 0.6% |
| 0.23 | 1 | 0.6% |
| 557.41 | 1 | 0.6% |
| 19.24 | 1 | 0.6% |
| 1.93 | 1 | 0.6% |
| Other values (126) | 126 | |
| (Missing) | 18 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 0.08 | 1 | 0.6% |
| 0.11 | 1 | 0.6% |
| 0.12 | 1 | 0.6% |
| 0.13 | 1 | 0.6% |
| 0.15 | 1 | 0.6% |
| 0.18 | 2 | 1.2% |
| 0.19 | 1 | 0.6% |
| 0.22 | 1 | 0.6% |
| 0.23 | 1 | 0.6% |
| Value | Count | Frequency (%) |
| 557.41 | 1 | |
| 458.63 | 1 | |
| 375.19 | 1 | |
| 365.31 | 1 | |
| 360.41 | 1 | |
| 358.45 | 1 | |
| 355.33 | 1 | |
| 348.96 | 1 | |
| 332.98 | 1 | |
| 329.59 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | Country | Population_(in_thousands)_total | Population_annual_growth_rate_(%) | Confirmed | Deaths | Recovered | Active | New_cases | New_deaths | New_recovered | Total_Recovered | Recovered_Percent | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | Afghanistan | 26088.0 | 4.0 | 36263.0 | 1269.0 | 25198.0 | 9796.0 | 106.0 | 10.0 | 18.0 | 25216.0 | 252.16 |
| 1 | 1 | Albania | 3172.0 | 0.6 | 4880.0 | 144.0 | 2745.0 | 1991.0 | 117.0 | 6.0 | 63.0 | 2808.0 | 28.08 |
| 2 | 2 | Algeria | 33351.0 | 1.5 | 27973.0 | 1163.0 | 18837.0 | 7973.0 | 616.0 | 8.0 | 749.0 | 19586.0 | 195.86 |
| 3 | 3 | Andorra | 74.0 | 1.0 | 907.0 | 52.0 | 803.0 | 52.0 | 10.0 | 0.0 | 0.0 | 803.0 | 8.03 |
| 4 | 4 | Angola | 16557.0 | 2.8 | 950.0 | 41.0 | 242.0 | 667.0 | 18.0 | 1.0 | 0.0 | 242.0 | 2.42 |
| 5 | 5 | Antigua and Barbuda | 84.0 | 1.3 | 86.0 | 3.0 | 65.0 | 18.0 | 4.0 | 0.0 | 5.0 | 70.0 | 0.70 |
| 6 | 6 | Argentina | 39134.0 | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | 7 | Armenia | 3010.0 | -0.3 | 37390.0 | 711.0 | 26665.0 | 10014.0 | 73.0 | 6.0 | 187.0 | 26852.0 | 268.52 |
| 8 | 9 | Austria | 8327.0 | 0.4 | 20558.0 | 713.0 | 18246.0 | 1599.0 | 86.0 | 1.0 | 37.0 | 18283.0 | 182.83 |
| 9 | 10 | Azerbaijan | 8406.0 | 0.6 | 30446.0 | 423.0 | 23242.0 | 6781.0 | 396.0 | 6.0 | 558.0 | 23800.0 | 238.00 |
Last rows
| df_index | Country | Population_(in_thousands)_total | Population_annual_growth_rate_(%) | Confirmed | Deaths | Recovered | Active | New_cases | New_deaths | New_recovered | Total_Recovered | Recovered_Percent | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 152 | 173 | Sweden | 9078.0 | 0.4 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 153 | 174 | Switzerland | 7455.0 | 0.4 | 1128.0 | 2.0 | 986.0 | 140.0 | 13.0 | 0.0 | 4.0 | 990.0 | 9.90 |
| 154 | 175 | Syria | 19408.0 | 2.7 | 67096.0 | 1636.0 | 37202.0 | 28258.0 | 835.0 | 11.0 | 317.0 | 37519.0 | 375.19 |
| 155 | 177 | Tajikistan | 6640.0 | 1.4 | 301708.0 | 45844.0 | 1437.0 | 254427.0 | 688.0 | 7.0 | 3.0 | 1440.0 | 14.40 |
| 156 | 178 | Tanzania | 39459.0 | 2.5 | 1202.0 | 35.0 | 951.0 | 216.0 | 10.0 | 1.0 | 3.0 | 954.0 | 9.54 |
| 157 | 180 | Timor-Leste | 1114.0 | 4.3 | 15988.0 | 146.0 | 9959.0 | 5883.0 | 525.0 | 4.0 | 213.0 | 10172.0 | 101.72 |
| 158 | 181 | Togo | 6410.0 | 2.7 | 431.0 | 0.0 | 365.0 | 66.0 | 11.0 | 0.0 | 0.0 | 365.0 | 3.65 |
| 159 | 182 | Tonga | 100.0 | 0.5 | 10621.0 | 78.0 | 3752.0 | 6791.0 | 152.0 | 2.0 | 0.0 | 3752.0 | 37.52 |
| 160 | 183 | Trinidad and Tobago | 1328.0 | 0.4 | 10.0 | 1.0 | 8.0 | 1.0 | 0.0 | 0.0 | 0.0 | 8.0 | 0.08 |
| 161 | 184 | Tunisia | 10215.0 | 1.1 | 1691.0 | 483.0 | 833.0 | 375.0 | 10.0 | 4.0 | 36.0 | 869.0 | 8.69 |